Local protein threading by Mixed Integer Programming
نویسندگان
چکیده
During the last decade, significant progresses have been made in solving the Protein Threading Problem (PTP). However, all previous approaches to PTP only perform global sequence–structure alignment. This obvious limitation is in clear contrast with the ”world of sequences”, where local sequence-sequence alignments are widely used to find functionally important regions in families of proteins. This paper presents a novel approach to PTP which allows to align a part of a protein structure onto a protein sequence in order to detect local similarities. We show experimentally that such local sequence– structure alignments improve the quality of the prediction. Our approach is based on Mixed Integer Programming (MIP) which has been shown to be very successful in this domain. We describe five MIP models for local sequence– structure alignments, compare and analyze their performances by using ILOG CPLEX 10 solver on a benchmark of proteins. Key-words: Mixed integer programming, combinatorial optimization, protein threading problem, protein structure alignment ∗ IRISA-Symbiose team, Campus de Beaulieu, 35042 Rennes Cedex, France † Faculty of Mathematics and Informatics, Sofia University, blvd. James Bourchier 5,1164, Sofia, Bulgaria ‡ Institute of Mathematics and Informatics, Bulgarian Academy of Sciences § MIG INRA, Domaine de Vilvert, 78350 Jouy en Josas Cedex, France in ria -0 04 36 33 5, v er si on 1 26 N ov 2 00 9 Local Protein Threading by Mixed Integer Programming Résumé : Au cours des dernières décennies, le problème de la reconnaissance de repliements des protéines ou Protein Threading Problem (PTP) a connu de grandes avancées. Néanmoins, toutes les méthodes développées ne proposent que des alignements séquence–structure globaux. Or, les alignements séquence– séquence locaux ont montrés qu’ils permettaient de détecter des régions fonctionnelles dans des familles de protéines. Ce rapport présente une nouvelle approche de la reconnaissance des repliements qui permet d’aligner une partie d’une structure de protéine avec une partie d’une séquence de protéine afin de détecter des similarités locales. Nous montrons que cette approche, basée sur la programmation mixte en nombre entiers (MIP), améliore la qualité de la reconnaissance. Nous avons modélisé les alignements séquence–structure locaux par cinq modèles MIP que nous comparons et analysons grâce au logiciel CPLEX 10.0 sur un jeu de test de protéines. Mots-clés : Programmation mixte en nombre entiers, Optimisation combinatoire, Reconnaissance des repliements, Alignements séquence–structure de protéines in ria -0 04 36 33 5, v er si on 1 26 N ov 2 00 9 Local Protein Threading by Mixed Integer Programming 3
منابع مشابه
Solving the Protein Threading Problem in Parallel
We propose a network flow formulation for protein threading and show its equivalence with the shortest path problem on a graph with a very particular structure. The underying Mixed Integer Programming (MIP) model proves to be very appropriate for the protein threading problem–huge real-life instances have been solved in a reasonable time by using only a Mixed Integer Optimizer instead of a spec...
متن کاملProtein Threading Based on Nonlinear Integer Programming
Protein threading is a method of computational protein structure prediction used for protein sequences which have the same fold as proteins of known structures but do not have homologous proteins with known structure. The most popular algorithm is based on linear integer programming. In this paper, we consider methods based on nonlinear integer programming. Actually, the existing linear integer...
متن کاملProtein Threading by Linear Programming
Protein three-dimensional structure prediction through threading approach has been extensively studied and various models and algorithms have been proposed. In order to further explore ways to improve accuracy and efficiency of the threading process, this paper investigates the effectiveness of a new method: protein threading via linear programming. Based on the contact map model of protein 3D ...
متن کاملSufficient global optimality conditions for general mixed integer nonlinear programming problems
In this paper, some KKT type sufficient global optimality conditions for general mixed integer nonlinear programming problems with equality and inequality constraints (MINPP) are established. We achieve this by employing a Lagrange function for MINPP. In addition, verifiable sufficient global optimality conditions for general mixed integer quadratic programming problems are der...
متن کاملALTERNATIVE MIXED INTEGER PROGRAMMING FOR FINDING EFFICIENT BCC UNIT
Data Envelopment Analysis (DEA) cannot provide adequate discrimination among efficient decision making units (DMUs). To discriminate these efficient DMUs is an interesting research subject. The purpose of this paper is to develop the mix integer linear model which was proposed by Foroughi (Foroughi A.A. A new mixed integer linear model for selecting the best decision making units in data envelo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Discrete Applied Mathematics
دوره 159 شماره
صفحات -
تاریخ انتشار 2011